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We consider polymers in which M randomly selected pairs of monomers are restricted to be in 
contact. Analytical arguments and numerical simulations show that an ideal (Gaussian) chain of 
N monomers remains expanded as long as M -C N; its mean squared end to end distance growing 
as r 2 oc M/N. A possible collapse transition (to a region of order unity) is related to percolation 
in a one dimensional model with long-ranged connections. A directed version of the model is also 
solved exactly. Based on these results, we conjecture that the typical size of a self-avoiding polymer 
is reduced by the links to 7? > {NjM) v . The number of links needed to collapse a polymer in three 
dimensions thus scales as N^, with <j> > 0.43. 
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I. INTRODUCTION 



Polymers subject to both repulsive and attractive self- 
interactions may have different equilibrium states de- 
pending on temperature. For example, a homogeneous 
self-avoiding polymer with short-range attractions be- 
tween its monomers undergoes a collapse transition |l]] 
from an expanded to a compact state as the temperature 
is lowered through a "0-point" . In the expanded state, 
the radius of gyration (root-mean-squared size) R g of 
the polymer scales as N v , where N is the number of 
monomers, and v (approximately 0.588 in d = 3 dimen- 
sions 0) is the swelling exponent. The compact state 
has a finite density, and hence v = 1/3. Upon collapse, 
contacts in the polymer (pairs of monomers located adja- 
cent to each other in the embedding space) both increase 
drastically, and change qualitatively. The description of 
such changes constitutes another important characteris- 
tic of the collapse transition. 

Statistics of such contacts is even more important in 
the characterization of the equilibrium states of hetero- 
geneous polymers H such as biomolecules. Dynamics 
of protein folding is also influenced by contacts between 
aminoacids (see, e.g. [El). While in problems of this type 
the contacts between monomers are temporarily gener- 
ated in the process of thermodynamic equilibrium, it is 
interesting to ask the inverse question of whether the con- 
figuration of a polymer can be described by specifying 
permanent contacts between its monomers. The issue of 
permanent contacts has also been extensively addressed 
in the context of rubber elasticity and the vulcanization 
process l^), where the typical situation is a polymer 
melt with permanent crosslinks between the polymers. 

Gutin and Shakhnovich (GS) || analyzed the problem 
of a single polymer chain in which pairs of monomers 
are forced to remain adjacent to each other, i.e. are per- 
manently linked. They note that in order to establish 



a meaningful relation between the distribution of such 
contacting pairs, and the behavior of real self-interacting 
polymers without permanent links, the choice of the en- 
semble of contacts must be very non-random: The num- 
ber of random selections of contacting pairs of monomers 
is of the order of N N , while the number of spatial config- 
urations increases only exponentially with N. It is thus 
not possible to find a simple correspondence between the 
two random ensembles. GS suggest that a correlated se- 
lection of constrains is needed to generate appropriate 
spatial configurations. 

Recently, Bryngelson and Thirumalai (BT) [jl0| con- 
sidered a related problem in which links are randomly 
established between pairs of monomers on an ideal (i.e. 
non-self-interacting) polymer. The unconstrained ideal 
chain is expanded with R g oc N 1 / 2 . By comparing vari- 
ational estimates to the free energies of expanded and 
collapsed states, BT argue that increasing the number of 
(uncorrelated) links causes the polymer to collapse into a 
state in which R g is independent of N. In particular, they 
conclude that for a generic set of constraints, in which 
the typical distance £ (measured along the backbone) be- 
tween linked monomers is of the order N, it suffices to 
have a negligible density of the constrains (~ 1/hxN) for 
such a collapse to occur. 

In this work we consider several models of polymers 
with randomly linked monomers. An essential feature of 
all these models is that the links along the polymer are se- 
lected in uncorrelated fashion. Unlike the previous work 
of BT, which concentrated on estimates of the free energy, 
we directly measure the spatial extent of the polymer. 
For ideal chains, we derive exact lower bounds which 
prove that uncorrelated links cannot cause the polymer 
to collapse. This is confirmed by extensive numerical 
simulations. Based on these results, we conjecture that, 
quite generally, the presence of M random links reduces 
the typical size of a swollen polymer to R > (N/M) v . 
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For ideal chains, a collapse (to R ~ 0(1)) occurs only 
when the number of links is of the order of N. However, 
it should be easier to collapse self-avoiding chains to a 
compact globular state with R oc A 1 / 3 . 

The remainder of the paper is organized as follows. 
The simplest model of an ideal chain with permanent 
links is introduced in Sec. 0. We show that calculation 
of the squared end to end distance is equivalent to deter- 
mining the resistance of a related resistor network. This 
equivalence provides a powerful numerical tool that is 
exploited in the following sections. Section III focuses 
on the case where the contacts are formed only between 
nearby monomers. We demonstrate that even a finite 
density of such contacts does not lead to a collapsed state. 
Analogies to percolation in one dimension suggest that 
collapse requires contacts between far away points. Ac- 
cordingly, in Sec. IV, we consider links between any two 
randomly chosen points on the chain. In this limit, a rig- 
orous lower bound indicates that a finite contact density 
is necessary to cause a collapse. Several features of these 
models are shared by a simple directed polymer with ran- 
domly forced links to the origin. As discussed in Sec. [v|, 
this model has the advantage of being exactly solvable. 
Further discussions of the results, and their possible ex- 
tension to self-avoiding polymers, appear in conclusion 
(Sec. 0). 



II. MODEL 



Following BT, we use a discrete Gaussian chain of N+l 
monomers, subject to a Hamiltonian 
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(1) 



to describe the ideal polymer. Here fi is the position of 
the ith monomer in the d dimensional embedding space, 
j3 = 1/(&bT), and k is the inverse of the mean-squared 
distance between adjacent monomers (Kuhn length). We 

next select M pairs of monomers l&j 1 ', &j- 2 ^}i f° r 3 — 
1, 2, • ■ • , M and constrain each pair to remain in contact. 
The statistical weight of the configurations is now given 
by 



M 



(2) 



Fig. [l]a depicts schematically a simple case of such a poly- 
mer, with dashed lines connecting the linked monomers. 
The same chain is redrawn in Fig. |l|b in such a way that 
the paired monomers are placed at the same point in 
space. 
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FIG. 1. (a) A schematic drawing of a chain with two linked 
pairs; (1,5) and (4,7). The circles represent monomers, the 
solid lines depict bonds between neighboring monomers, and 
the dashed lines connect pairs of monomers which are forced 
to remain in contact, (b) The same chain as in (a), but with 
every pair of linked monomers depicted by a single circle. 

The spatial extent of the polymer can be character- 
ized by the thermal average (with weight P) of the 
squared end to end distance r 2 = ((fo — r/v) 2 ), or by 
the squared radius of gyration R 2 g which is the average of 

rf m = ((ft — r m ) 2 ) over all pairs of monomers {l,m}. We 
note that the fluctuations of the polymer in each of the d 
perpendicular space directions is independent of the oth- 
ers. Thus r 2 is simply the sum of d contributions which 
can be calculated separately by considering one dimen- 
sional chains. Therefore, without loss of generality we 
restrict ourselves to a one dimensional Gaussian chain. 

It is convenient to recast the problem in terms of a 
resistor network: Mathematically, calculation of r 2 for a 
Gaussian polymer with constraints is identical to finding 
the resistance of a chain built of elementary resistors with 
conductances k, in which the pairs of sites /cj 2 '' j 

are connected by zero resistance links (shorts). Thus, 
Fig. [I] may also represent an electrical circuit, where the 
dashed lines indicate the shorts. This exact correspon- 
dence [|llj holds for any arbitrary Gaussian network: The 
dissipation on a link ab is given by K a b(4> a — <Pb) 2 , where 
K a i, is the link conductance and <p a is the potential on a 
node. This is analogous to the appropriate elastic term 
in the Gaussian Hamiltonian. The resistance between 
any two nodes j and k is calculated by minimizing the 
overall dissipation under applied external voltage. For a 
quadratic form this minimization is equivalent to calcu- 
lating averages of (<f>j — cf>k) 2 with the Gaussian weight 
built using this form. Therefore, our task is reduced to 
calculating the resistance of networks such as the one in 
Fig. ^b. In the remainder of the paper we shall use the 
terminology of the Gaussian polymer and of the resistor 
network interchangeably. 

Note that k, which is the force constant for the Gaus- 
sian chain or the conductance of an elementary resistor, 
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appears only as a overall prefactor (1/k) in calculations 
of r 2 or resistance. Thus without loss of generality we 
set k = 1, making these quantities dimensionless. Cal- 
culating the resistance of a chain with a specified set 
of links is now accomplished using elementary methods: 
The configuration is first recast in the form of a simple 
electrical network, as in the process leading from Fig. |l|a 
to Fig. [j]b. At this point each resistor is assigned a unit 
resistance. Pairs of resistors which are in series or parallel 
are replaced by effective resistors. Repeated application 
of this process leads to a network of not more than M 
nodes connected by effective resistors. (For low densities 
of shorts, the number of nodes is much less than M. For 
example, reduction by series and resistor rules is sufficient 
to completely eliminate all internal nodes in Fig. |l|b.) Fi- 
nally, the resistance of the reduced network is calculated 
by solving a system of linear equations. The number of 
unknowns (and equations) is of order of (or smaller than) 
M , and thus much smaller than N. Therefore, for each 
N and M, we could easily average our numerical results 
over large numbers (up to 1600) of configurations with 
randomly distributed links. 



III. NEARBY LINKS 

We start by considering links that join monomers that 
are close-by along the chain. As a simple example con- 
sider a very long chain (N 3> 1) with M contact pairs 
scattered randomly along the chain: The position k^p of 
the first monomer of the jth pair is chosen with uniform 
probability anywhere along the chain, while the second 
member of the pair is located at k^ — ki + £, i.e. the 
distances between any two members of a pair (measured 
along the backbone) are fixed at I. (We assume that 

For the corresponding electrical circuit, it is obvious 
that the total resistance is proportional to N (because 
the shorts are local). The problem is characterized by 
the density n = M/N, and "coverage" c = n£ of the 
links. As long as the coverage is small (c <C 1), the 
different pairs do not bridge over overlapping strands 
of the chain; the total resistance is obtained simply 
by removing the part of the chain that is shortened as 
r 2 = N - Ml = (1 - c)N w e~ c N. 

When c is comparable or larger than unity, the re- 
sistance of the chain drops significantly. However, it can 
still be bounded from below by the total resistance of the 
resistors which are not bridged by the shorts. An "un- 
bridged resistor" is such that there are no shorts which 
begin to its left and end to its right. Since the proba- 
bility of such a condition for each resistor is (1 — n) e , a 
lower bound on the resistance is given by (1 — n) e N. The 
continuum limit (where the discreteness of the chain can 
be disregarded) is reached when n -C 1 and f <C I <C N. 
In this limit the bound becomes e~ c N. (Note that c does 
not have to be small.) This lower bound shows that the 



chain is not collapsed for any coverage, i.e. r 2 ~ N, al- 
though the prefactor may be very small when the links 
are dense. 

The results of Monte Carlo simulations on this model 
are depicted in Fig. ^|. In the continuum limit, the re- 
sistance is expected to have the form g(c)N , where g is 
some unknown function. The collapse of the data for 
different values of £ = 8, 32, and 128 confirms this expec- 
tation. There are slight systematic deviations for I = 8 
which are due to the discreteness of the chain. The lower 
bound of e~ c is indicated by the solid line in this figure, 
and is quite a good estimate for small values of coverage. 
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FIG. 2. Normalized resistance, or r 2 , of a chain of 
N = 2560 monomers, for links of fixed lengths £ = 8 (tri- 
angles), £ — 32 (squares), and £ = 128 (circles) as a function 
of the coverage c = Mi/N. 

There are close analogies between random resistor net- 
works and percolation. In particular, the collapse of the 
Gaussian chain can be related the presence of infinite 
cluster of shorts. In the corresponding percolation prob- 
lem, long-range shorts are added to a one-dimensional 
system II4] in uncorrelated fashion. The results of this 
section were obtained for the simple case where the shorts 
connect points at a fixed distance £. However, the conclu- 
sions can be generalized to the case when £ is randomly 
distributed with a probability p{£) . 

An important characteristic of long-range percolation 
is the coverage c = YITLi £p(£)- As long as c is finite, the 
shorts do not form an infinite cluster |1^] . The resistance 
(or r 2 ) is then proportional to N, with a lower bound of 
e~ c N. Power-law distributions p(£) rs B/£ a (for £ — > 00) 
are frequently used to describe long-range percolation. 
For a > 2 there is a finite c, leading to the situation 
described above. For a < 1, the sum n = X^i^W 
diverges, i.e. the number of contacts per monomer is in- 
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finite. In this case, an infinite clusters of shorts always 
exists, although the situation does not correspond to a 
realistic physical model, since M/N — > oo. A somewhat 
more realistic situation occurs for 1 < a < 2, where it 
can be shown [|l3| that for n < 1/2 no infinite cluster is 
formed. For n > 1/2, depending on the details of p(£) 
(e.g., the value of the constant B), percolation may or 
may not occur juj. This again corresponds to M « N. 



IV. DISTANT LINKS 

The distributions with a < 2 cannot be directly used 
to describe the behavior of a finite polymer, because the 
divergence of c implies the presence of strong finite size 
effects. We shall, therefore, consider the extreme case of 
a broad distribution of £ by assuming that for finite N the 
typical £ is of order of N. The simplest possible situation 
is obtained when kj and of the jth link are selected 
independently and uniformly among all monomers. We 
are interested in calculating r 2 for such a chain with M 
links. 

We immediately notice a simple scaling argument: 
Consider a chain of length TV with M links located at 

Ifcj 1 ', fcj 2 '|, and compare it with another chain of A7V 

monomers with links at |Afcj 1 ' ) , Afcj 2 ' |. Clearly, r 2 of 

the latter sequence is exactly A times larger than the 
former. (We consider the limit N ^> 1 where the dis- 
creteness effects can be disregarded.) The corresponding 
probabilities of finding such randomly linked chains are 
identical. (More precisely, the probability for a link in 
the first ensemble to be located between x and x + dx 
is equal to the probability for a link in the second en- 
semble to be located between Xx and X(x + dx).) From 
the relation r 2 M XN — Xr 2 M N , it immediately follows that 
r 2 = f(M)N. To obtain a collapsed state of the chain 
we must have f(M) ~ 1/N. We shall show that such 
a small value of / is reached only when TV ~ M, when 
essentially every monomer is paired with another. 

We first obtain a lower bound for r 2 in this case. The 
M shorts break the chain backbone into 2M + 1 seg- 
ments. The resistance of the chain is certainly larger 
than that of the two extremal segments at its two ends. 
In Fig. ||, this corresponds to distance between and 1, 
plus the distance between 7 and 9. In the limit of large 
M, each segment is independently taken from an expo- 
nential probability distribution (see Sec. ^) with mean 
size N/(2M + 1). The resistances of the two end seg- 
ments thus add up to 2JV/(2M + 1) « N/M. There- 
fore, for large M we have f(M) = r 2 /N > 1/M. Note, 
that this bound ensures the absence of a collapsed state 
for vanishing density M/N , contradicting a prediction of 
Ref. @. 

We were unable to derive a satisfactory upper bound 
for f(M). It can be crudely argued that it is bounded 
from above by (In M)/M: The resistance of the chain 



should be smaller than that of a single path going from 
one end of the chain to the other, either by the way of 
links or through the shorts. Since (for small M) the num- 
ber of different L-step paths (on the space of 2M + 1 
segments) beginning from monomer 0, increases expo- 
nentially with L, the probability that at least one of the 
paths reaches the point N becomes of the order of unity 
when L ~ InM. The resistance of the path which has 
reached the end-point can now be estimated by multi- 
plying the resistance of a typical segment, approximately 
N/(2M), by the number of segments, ~ InM. There- 
fore, this overestimate of the resistance is approximately 
TV In M/M. 

We confirmed numerically the scaling form of r 2 for 
several values of N. Fig. |^ depicts (on a logarithmic 
scale) the scaled resistance / = r 2 /N, as a function of 
M, for a single value of N — 2560. Every point on this 
figure represents an average over 1600 configurations of 
random links. The numerical results gradually converge 
to a slope of —1, as depicted by the solid line. A least 
squares fit to all points of the figure produces a slope of 
0.97, and the curve cannot be fitted as In M/M. In fact, 
we conclude that f(M) ~ 1.5/M; with a prefactor that 
is surprisingly close to the value of 1 which appears in 
our simple lower bound. 
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FIG. 3. Logarithmic plot of the scaled resistance r 2 /N, as 
a function of number of links M. Each point represents an 
average over 1600 randomly linked chains. The slope of the 
solid line is — 1. 



V. DIRECTED POLYMERS 

In this section we introduce a simpler model in which 
a random set of monomers is linked to the origin. Alter- 
natively, the model describes a directed polymer in d + 1 
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dimensions, in which certain certain points are linked to 
a line at the origin. In the latter example, denotes 
the transverse coordinate of the ith monomer. The M 
randomly chosen monomers {kj}, for j = 1, 2, • • • , M are 
linked to the origin, resulting in the statistical weight, 



M 



P[{n}\ oc exp-^ (r kj ). 



(3) 



The constraints break the chain into M+l independent 
segments of lengths {s a }, with a = 1, 2, • • • , M+ 1. This 
model is easily solvable in the limit of large M, where 
it is sufficient to apply the constraint J2 a Sa ~ ^ on ty 
on average. Subject to this constraint, the joint proba- 
bility of segment lengths is maximized for a product of 
independent exponential distributions 



M + l 
P{s) = N exp 



M + l 

N ' 



(4) 



For each segment, we thus have (s m ) « m\(N / M) m . 

The end to end distance in this case is given sim- 
ply by the contribution of the two end segments, and 
^?' 2 ) = 2N/M. We can also calculate an average of the 
squared distance from the origin, 



R 2 



N M+l 



(5) 



i=0 



a=l 



where we have taken advantage of the independence of 
segments. It is easy to show that each segment con- 
tributes r(s) 2 = s 2 /6, resulting in 



1 



(6) 



(We note that R 2 does not coincide with R 2 , but differs 
from it only by a term of order N/M 2 , which becomes 
negligible for large M.) We can also consider cases where 
the chains are non-ideal, such as directed polymers in a 
random medium ]l5| ]. In such cases, each segment wan- 
ders away from the origin by an amount s v with v > 1/2. 
The overall radii of the randomly linked polymer are then 
characterized by the scale {N/M) v . 



s Ri N/M, where v is the swelling exponent. The numer- 
ical results of Sec. [V, and the analytical forms of Sec. [v|, 
suggest that this bound is generally satisfied. We thus 
conjecture that the typical size of a self-avoiding chain 
with M random links is given by 



{R} <x 



(7) 



Assuming the validity of the above conjecture, we can 
ask how many links are necessary to compactify a chain. 
An ideal chain in its compact state is localized to a region 
of size R ~ 0(1). This is achieved only with an exten- 
sive number of links M oc N. For a self-avoiding chain, 
however, the compact state has finite density, and hence 
R oc N 1 ^. Comparing with Eq. (Q), suggests that such 
compactification is achieved if the number of links scales 
as 



AfociV with (j)=l-—. 

dv 



(8) 



For a self-avoiding polymer in d = 3, <j> » 0.43, and 
d> = 1/3 in d = 2. (While this certainly gives the min- 
imum number of bonds necessary for collapse, its suffi- 
ciency remains to be established.) 

The above result suggests that it is much easier to com- 
pactify a self-avoiding polymer. However, it says very lit- 
tle about the final structure of the compact state. From 
the perspective of protein folding studies || , the resulting 
state is most likely a compact globule; a liquid-like state 
with extensive entropy. Additional links would then be 
needed to freeze this compact object into a unique con- 
figuration. The radius of gyration is then not a good 
discriminator of the state of the macromolecule. 

At the completion of this work, we became aware of a 
recent paper by Solf and Vilgis JTJ| . Their starting point 
is a randomly crosslinked Gaussian network, motivated 
by the Deam and Edwards Q model of polymeric gels. 
Although they consider more general networks, they also 
perform simulations on the model of Sec. IV. They mea- 
sure the radius of gyration, finding R 2 w 0.26A/M, con- 
sistent with our results. Interestingly, the ratio of R 2 to 
r 2 (calculated in our paper) is approximately 1/6, as in 
the case of an unconstrained ideal chain. 



VI. DISCUSSIONS 

Most of the results described so far apply to ideal 
chains. However, we may argue that some of conclusions 
are expected to hold for self-avoiding polymers. Con- 
sider, for example, the lower bound obtained in Sec. IV. 
Even for a self-avoiding walk, it is reasonable to expect 
that the overall polymer size is larger than the contribu- 
tions of its two end segments. Although the interactions 
amongst the segments make an exact argument difficult, 
it is probably reasonable to assume that each end seg- 
ment has a characteristic size proportional to s" with 
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